Learning From Demonstrations via Structured Prediction

نویسندگان

  • Charles Parker
  • Prasad Tadepalli
  • Weng-Keen Wong
  • Thomas Dietterich
  • Alan Fern
چکیده

Demonstrations from a teacher are invaluable to any student trying to learn a given behavior. Used correctly, demonstrations can speed up both human and machine learning by orders of magnitude. An important question, then, is how best to extract the knowledge encoded by the teacher in these demonstrations. In this paper, we present a method of learning from demonstrations that leverages some of the structured prediction techniques currently under investigation in the literature. We report encouraging results in Wargus, a real-time strategy game.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

An investigation of imitation learning algorithms for structured prediction

In the imitation learning paradigm algorithms learn from expert demonstrations in order to become able to accomplish a particular task. Daumé III et al. (2009) framed structured prediction in this paradigm and developed the search-based structured prediction algorithm (Searn) which has been applied successfully to various natural language processing tasks with state-of-the-art performance. Rece...

متن کامل

Robobarista: Object Part Based Transfer of Manipulation Trajectories from Crowd-Sourcing in 3D Pointclouds

There is a large variety of objects and appliances in human environments, such as stoves, coffee dispensers, juice extractors, and so on. It is challenging for a roboticist to program a robot for each of these object types and for each of their instantiations. In this work, we present a novel approach to manipulation planning based on the idea that many household objects share similarly-operate...

متن کامل

Robobarista: Learning to Manipulate Novel Objects via Deep Multimodal Embedding

There is a large variety of objects and appliances in human environments, such as stoves, coffee dispensers, juice extractors, and so on. It is challenging for a roboticist to program a robot for each of these object types and for each of their instantiations. In this work, we present a novel approach to manipulation planning based on the idea that many household objects share similarly-operate...

متن کامل

Predict and Constrain: Modeling Cardinality in Deep Structured Prediction

Many machine learning problems require the prediction of multi-dimensional labels. Such structured prediction models can benefit from modeling dependencies between labels. Recently, several deep learning approaches to structured prediction have been proposed. Here we focus on capturing cardinality constraints in such models. Namely, constraining the number of non-zero labels that the model outp...

متن کامل

Bipedal Locomotion Primitive Learning, Control and Prediction from Human Data

At the current stage bipedal robot locomotion is quite different from human walking. Imitation learning framework from human demonstrations is an efficient approach to lead towards human-like behaviors. This paper addresses a framework for real-time wholebody human motion imitation by a humanoid robot. The framework is a structured mixture of whole body motion control, learning and prediction. ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007